Study and Implementation of Monolingual Approach on Indonesian Question Answering for Factoid and Non-Factoid Question
نویسندگان
چکیده
We developed an open domain QA system that can handle factoid and nonfactoid questions in Indonesian language by using monolingual approaches. EAT classification is done by identifying question word and clue words. Keyword extraction from question is done by looking at POS information of each word in question, eliminating stop words, and stemming. We use articles from Indonesian Wikipedia as corpus and Lucene framework as the base for passage retriever component, with three additional processing: query expansion, boost EAT, and boost term. For factoid questions, answer finding is done by using Named Entity Recognition. Answer scoring is done by calculating keyword occurrences and answer-keywords distance (MRR = 0.6191). For non-factoid questions, answer finding is done by identifying sentence pattern and clue words. Answer scoring is done by considering pattern priority and keyword occurrences (MRR = 0.8079).
منابع مشابه
ارایه یک پیکره پرسش و پاسخ مذهبی در زبان فارسی
Question answering system is a field in natural language processing and information retrieval noticed by researchers in these decades. Due to a growing interest in this field of research, the need to have appropriate data sources is perceived. Most researches about developing question answering corpus area have been done in English so far, but in other languages as Persian, the lack of these co...
متن کاملThe Role of Lexical Features in Question Answering for Spanish
This paper describes the prototype developed in the Language Technologies Laboratory at INAOE for the Spanish monolingual QA evaluation task at CLEF 2005. The proposed approach copes with the QA task according to the type of question to solve (factoid or definition). In order to identify possible answers to factoid questions, the system applies a methodology centered in the use of lexical featu...
متن کاملAnalysis of Semantic Classes: Toward Non-Factoid Question Answering
Analysis of Semantic Classes: Toward Non-Factoid Question Answering Yun Niu Doctor of Philosophy Graduate Department of Computer Science University of Toronto 2007 The task of question answering (QA) is to find the accurate and precise answer to a natural language question in some predefined text. Most existing QA systems handle fact-based questions that usually take named entities as the answe...
متن کاملSpinning Straw into Gold: Using Free Text to Train Monolingual Alignment Models for Non-factoid Question Answering
Monolingual alignment models have been shown to boost the performance of question answering systems by ”bridging the lexical chasm” between questions and answers. The main limitation of these approaches is that they require semistructured training data in the form of question-answer pairs, which is difficult to obtain in specialized domains or lowresource languages. We propose two inexpensive m...
متن کاملA Monolithic Approach and a Type-by-Type Approach for Non-Factoid Question-answering - Yokohama National University at NTCIR-6 QAC
In order to process non-factoid questions in NTCIR6 QAC, we introduced two types of approaches. First one has a monolithic architecture that retrieves answer passages related to a question using lexical chain. The other one has a type-by-type architecture and consists of four subsystems: i) the subsystem for definitional and other-type questions, ii) the subsystem for why-type questions, iii) t...
متن کامل